A Survey on Machine Learning Methods in Spam Filtering
نویسنده
چکیده
Email spam or junk e-mail (unwanted e-mail “usually of a commercial nature sent out in bulk”) is one of the major issue of the today's Internet, that cause financial damage to companies and annoying individual users. Among the approaches developed to stop spam, filtering is an important and popular one. Common uses for mail filters include organizing incoming email and removal of spam and computer viruses. A less common use is to inspect outgoing email at some companies to ensure that employees comply with appropriate laws. Users might also employ a mail filter to prioritize messages, and to sort them into folders based on subject matter or other criteria. Mail filters can be installed by the user, either as separate programs, or as part of their email program (email client). In email programs, users can make personal, "manual" filters that then automatically filter mail according to the chosen criteria. In this paper, we present a survey of the performance of four commonly used machine learning methods in spam filtering. Most email programs now also have an automatic spam filtering function.
منابع مشابه
Spam Filtering Methods and machine Learning Algorithm - A Survey
Social networking websites are used by millions of people around the world. People express their views, opinions and share current topics. Millions of data generated every day. It’s a good platform to connect with the people. Now a day’s spammers used this platform to advertise spam content on the social networking websites. The proposed system used to classify tweets into different groups as s...
متن کاملLearning Spam: Simple Techniques For Freely-Available Software
The problem of automatically filtering out spam e-mail using a classifier based on machine learning methods is of great recent interest. This paper gives an introduction to machine learning methods for spam filtering, reviewing some of the relevant ideas and work in the open source community. An overview of several feature detection and machine learning techniques for spam filtering is given. T...
متن کاملAdvances in Online Learning-based Spam Filtering
The low cost of digital communication has given rise to the problem of email spam, which is unwanted, harmful, or abusive electronic content. In this thesis, we present several advances in the application of online machine learning methods for automatically filtering spam. We detail a sliding-window variant of Support Vector Machines that yields state of the art results for the standard online ...
متن کاملA Machine Learning Approach to Server-side
Spam-detection systems based on traditional methods have several obvious disadvantages like low detection rate, necessity of regular knowledge bases’ updates, impersonal filtering rules. New intelligent methods for spam detection, which use statistical and machine learning algorithms, solve these problems successfully. But these methods are not widespread in spam filtering for enterprise-level ...
متن کاملThe Fight against Spam - A Machine Learning Approach
The paper presents a brief survey of the fight between spammers and antispam software developers, and also describes new approaches to spam filtering. In the first two sections we present a survey of the currently existing spam types. Some well-mapped spammer tricks are also described, although the imagination of spam distributors is endless, and therefore only the most common tricks are covere...
متن کامل